Search CORE

EndoNet: an information resource about regulatory networks of cell-to-cell communication†

Author: A. P. Potapov
Ananko
B. Goemann
Bader
Bard
E. Wingender
H. Michael
Heinemeyer
Hodges
J. Donitz
Joshi-Tope
Kanehisa
Kanehisa
Keseler
Krull
Lemer
M. Lize
Matys
N. Sasse
Salwinski
Wingender
Wu
Publication venue: Oxford University Press
Publication date
Field of study

EndoNet is an information resource about intercellular regulatory communication. It provides information about hormones, hormone receptors, the sources (i.e. cells, tissues and organs) where the hormones are synthesized and secreted, and where the respective receptors are expressed. The database focuses on the regulatory relations between them. An elementary communication is displayed as a causal link from a cell that secretes a particular hormone to those cells which express the corresponding hormone receptor and respond to the hormone. Whenever expression, synthesis and/or secretion of another hormone are part of this response, it renders the corresponding cell an internal node of the resulting network. This intercellular communication network coordinates the function of different organs. Therefore, the database covers the hierarchy of cellular organization of tissues and organs as it has been modeled in the Cytomer ontology, which has now been directly embedded into EndoNet. The user can query the database; the results can be used to visualize the intercellular information flow. A newly implemented hormone classification enables to browse the database and may be used as alternative entry point. EndoNet is accessible at: http://endonet.bioinf.med.uni-goettingen.de

Application of regulatory sequence analysis and metabolic network analysis to the interpretation of gene expression data

Author: A. Brazma
A.J. Enright
D. Gilbert
D. Thomas
E. Wingender
E.M. Marcotte
E.M. Marcotte
G. Reinert
H. Salgado
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J.H. Graber
J.L. DeRisi
M. Kanehisa
M. Pellegrini
M.B. Eisen
M.B. Eisen
P. Tamayo
P.D. Karp
P.O. Brown
P.T. Spellman
Publication venue: JOBIM
Publication date: 01/01/2000
Field of study

We present two complementary approaches for the interpretation of clusters of co-regulated genes, such as those obtained from DNA chips and related methods. Starting from a cluster of genes with similar expression profiles, two basic questions can be asked: 1. Which mechanism is responsible for the coordinated transcriptional response of the genes? This question is approached by extracting motifs that are shared between the upstream sequences of these genes. The motifs extracted are putative cis-acting regulatory elements. 2. What is the physiological meaning for the cell to express together these genes? One way to answer the question is to search for potential metabolic pathways that could be catalyzed by the products of the genes. This can be done by selecting the genes from the cluster that code for enzymes, and trying to assemble the catalyzed reactions to form metabolic pathways. We present tools to answer these two questions, and we illustrate their use with selected examples in the yeast Saccharomyces cerevisiae. The tools are available on the web (http://ucmb.ulb.ac.be/bioinformatics/rsa-tools/; http://www.ebi.ac.uk/research/pfbp/; http://www.soi.city.ac.uk/~msch/)

CiteSeerX

Brunel University Research Archive

DI-fusion

TRANSFAC(®) and its module TRANSCompel(®): transcriptional gene regulation in eukaryotes

Author: Barre-Dirrie A.
Chekmenev D.
Fricke E.
Hornischer K.
Kel A. E.
Kel-Margoulis O. V.
Krull M.
Land S.
Lewicki-Potapov B.
Liebich I.
Matys V.
Reuter I.
Saxel H.
Stegmaier P.
Voss N.
Wingender E.
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

The TRANSFAC(®) database on transcription factors, their binding sites, nucleotide distribution matrices and regulated genes as well as the complementing database TRANSCompel(®) on composite elements have been further enhanced on various levels. A new web interface with different search options and integrated versions of Match™ and Patch™ provides increased functionality for TRANSFAC(®). The list of databases which are linked to the common GENE table of TRANSFAC(®) and TRANSCompel(®) has been extended by: Ensembl, UniGene, EntrezGene, HumanPSD™ and TRANSPRO™. Standard gene names from HGNC, MGI and RGD, are included for human, mouse and rat genes, respectively. With the help of InterProScan, Pfam, SMART and PROSITE domains are assigned automatically to the protein sequences of the transcription factors. TRANSCompel(®) contains now, in addition to the COMPEL table, a separate table for detailed information on the experimental EVIDENCE on which the composite elements are based. Finally, for TRANSFAC(®), in respect of data growth, in particular the gain of Drosophila transcription factor binding sites (by courtesy of the Drosophila DNase I footprint database) and of Arabidopsis factors (by courtesy of DATF, Database of Arabidopsis Transcription Factors) has to be stressed. The here described public releases, TRANSFAC(®) 7.0 and TRANSCompel(®) 7.0, are accessible under

CiteSeerX

An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs

Author: A Sandelin
A Sandelin
A Sharov
A Tomovic
Adrian J Shepherd
Armando Blanco
C Lawrence
D Denning
E Baker
E Szmidt
E Wingender
F Garcia
F Lam
F Lopez
F Offner
F Zare-Mirakabad
Fernando Garcia-Alcalde
G Chamilos
G Diop
G Hertz
J Hanley
J Hughes
J Sainz
J Van Helden
J Zhao
K Atanassov
K Atanassov
K Atanassov
K Atanassov
K Won
L Liang
L Zadeh
M Bulyk
M Das
M Eisen
N Dror
N Kim
P Benos
P Bochud
P Schling
R Gordan
S De
T Bailey
T Fawcett
T Hehlgans
T Tamura
T Tamura
V Khatibi
W Hung
W Wasserman
X Chen
Y Haudry
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: Transcription factors (TFs) control transcription by binding to specific regions of DNA called transcription factor binding sites (TFBSs). The identification of TFBSs is a crucial problem in computational biology and includes the subtask of predicting the location of known TFBS motifs in a given DNA sequence. It has previously been shown that, when scoring matches to known TFBS motifs, interdependencies between positions within a motif should be taken into account. However, this remains a challenging task owing to the fact that sequences similar to those of known TFBSs can occur by chance with a relatively high frequency. Here we present a new method for matching sequences to TFBS motifs based on intuitionistic fuzzy sets (IFS) theory, an approach that has been shown to be particularly appropriate for tackling problems that embody a high degree of uncertainty. Results: We propose SCintuit, a new scoring method for measuring sequence-motif affinity based on IFS theory. Unlike existing methods that consider dependencies between positions, SCintuit is designed to prevent overestimation of less conserved positions of TFBSs. For a given pair of bases, SCintuit is computed not only as a function of their combined probability of occurrence, but also taking into account the individual importance of each single base at its corresponding position. We used SCintuit to identify known TFBSs in DNA sequences. Our method provides excellent results when dealing with both synthetic and real data, outperforming the sensitivity and the specificity of two existing methods in all the experiments we performed. Conclusions: The results show that SCintuit improves the prediction quality for TFs of the existing approaches without compromising sensitivity. In addition, we show how SCintuit can be successfully applied to real research problems. In this study the reliability of the IFS theory for motif discovery tasks is proven

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Repositorio Institucional Universidad de Granada

UCL Discovery

Birkbeck Institutional Research Online

Quantitative model for inferring dynamic regulation of the tumour suppressor gene p53

Author: A Chipperfield
A Conesa
AR Joyce
AT Kwon
AW Braithwaite
C Moorman
CG Moles
CL Wei
D Chen
DG Sedding
E Wingender
G Liu
H de Jong
J Aach
J Goutsias
J Goutsias
J Wang
J Wang
J Wang
J Wang
JC Liao
JM Espinosa
Junbai Wang
K Zhu
KB Spurgers
KH Vousden
L Ma
M Barenco
MK Yeung
MR Bhonde
MV Karamouzis
N Sun
PS Kho
Q Wei
Q Wu
R Rahman-Roblick
RB Zhao
RC Gentleman
RS Erb
S Liu
S Rogers
SA Johnson
T Tian
Tianhai Tian
TS Gardner
TT Vu
WS el Deiry
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: The availability of various "omics" datasets creates a prospect of performing the study of genome-wide genetic regulatory networks. However, one of the major challenges of using mathematical models to infer genetic regulation from microarray datasets is the lack of information for protein concentrations and activities. Most of the previous researches were based on an assumption that the mRNA levels of a gene are consistent with its protein activities, though it is not always the case. Therefore, a more sophisticated modelling framework together with the corresponding inference methods is needed to accurately estimate genetic regulation from "omics" datasets. Results: This work developed a novel approach, which is based on a nonlinear mathematical model, to infer genetic regulation from microarray gene expression data. By using the p53 network as a test system, we used the nonlinear model to estimate the activities of transcription factor (TF) p53 from the expression levels of its target genes, and to identify the activation/inhibition status of p53 to its target genes. The predicted top 317 putative p53 target genes were supported by DNA sequence analysis. A comparison between our prediction and the other published predictions of p53 targets suggests that most of putative p53 targets may share a common depleted or enriched sequence signal on their upstream non-coding region. Conclusions: The proposed quantitative model can not only be used to infer the regulatory relationship between TF and its down-stream genes, but also be applied to estimate the protein activities of TF from the expression levels of its target genes

Directory of Open Access Journals

Enlighten

Vitamin D receptor ChIP-seq in primary CD4+ cells: relationship to serum 25-hydroxyvitamin D levels and autoimmune disease

Author: A Sandelin
A Sanyal
Adam E Handel
AE Handel
Antonio J Berlanga-Taylor
AP Boyle
B Langmead
B Lehmann
BE Bernstein
C Carlberg
CE Grant
CS Ross-Innes
CY McLean
D Berglund
E Wingender
F Birzele
Finn Drabløs
G Pavesi
Gavin Giovannoni
Geir K Sandve
George C Ebers
Giulio Disanto
Giuseppe Gallone
GK Sandve
Heather Hanwell
IV Kulakovskiy
J Orgaz-Molina
J-C Souberbielle
JHA Martens
K Li
KL Munger
LA Hindorff
LL Issa
M Ashburner
M Caliskan
M Lutz
M Thomas-Chollier
MA Kriegel
MD Shirley
ML McCullough
NU Rashid
O Weth
PA Fujita
PA Marshall
R Salehi-Tabar
RM Tolón
S Gundersen
S Heikkinen
Sreeram V Ramagopalan
SV Ramagopalan
T Liu
TA Owen
TL Bailey
TL Bailey
TL Bailey
Y Zhang
Y-C Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

PMCID: PMC3710212This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Queen Mary Research Online

NORA - Norwegian Open Research Archives

Efficient and accurate P-value computation for Position Weight Matrices

Author: A Liefooghe
C Pizzi
E Wingender
G Bejerano
GE Crooks
GZ Hertz
H Huang
Hélène Touzet
J Zhang
Jean-Stéphane Varré
JM Claverie
K Malde
M Beckstette
M Garey
R Staden
S Mount
S Rahmann
TD Wu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Position Weight Matrices (PWMs) are probabilistic representations of signals in sequences. They are widely used to model approximate patterns in DNA or in protein sequences. The usage of PWMs needs as a prerequisite to knowing the statistical significance of a word according to its score. This is done by defining the P-value of a score, which is the probability that the background model can achieve a score larger than or equal to the observed value. This gives rise to the following problem: Given a P-value, find the corresponding score threshold. Existing methods rely on dynamic programming or probability generating functions. For many examples of PWMs, they fail to give accurate results in a reasonable amount of time. Results The contribution of this paper is two fold. First, we study the theoretical complexity of the problem, and we prove that it is NP-hard. Then, we describe a novel algorithm that solves the P-value problem efficiently. The main idea is to use a series of discretized score distributions that improves the final result step by step until some convergence criterion is met. Moreover, the algorithm is capable of calculating the exact P-value without any error, even for matrices with non-integer coefficient values. The same approach is also used to devise an accurate algorithm for the reverse problem: finding the P-value for a given score. Both methods are implemented in a software called TFM-PVALUE, that is freely available. Conclusion We have tested TFM-PVALUE on a large set of PWMs representing transcription factor binding sites. Experimental results show that it achieves better performance in terms of computational time and precision than existing tools.</p

HAL - Lille 3

Directory of Open Access Journals

INRIA a CCSD electronic archive server